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Abstract. Motivated by non-equilibrium phenomena in nature, we study dynamical systems 
whose time-evolution is determined by non-stationary compositions of chaotic maps. The con- 
stituent maps are topologically transitive Anosov diffeomorphisms on a 2-dimensional compact 
Riemannian manifold, which arc allowed to change with time — slowly, but in a rather arbitrary 
fashion. In particular, such systems admit no invariant measure. By constructing a coupling, we 
prove that any two sufficiently regular distributions of the initial state converge exponentially 
with time. Thus, a system of the kind loses memory of its statistical history rapidly. 
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1. Introduction 

1.1. Motivation. Statistical properties of dynamical systems are traditionally studied in a sta- 
tionary context. Let us elaborate briefiy, discussing only discrete time for simplicity. Suppose 
is the set of all possible states of the system. Given the state x„ G at some time n > 0, 
the state of the system at time n-|- 1 is assumed to be either (i) Xn+i = Txn, where T : Ai ^ Ai 
is an a priori specified map used at every time step or (ii) Xn+i = Tn+iXn, where the maps 
Tj : — 7- are drawn randomly and independently of each other and of Xq, . . . ,Xi-i from 
a set of maps T according to a distribution t]. Now, suppose the initial point xq has random 
distribution fi: Prob(a;o G i?) = l^^iE), for all measurable sets E G Ai. By stationarity we mean 
that Prob(a;„ G E) = n{E), for all n > 0, for all measurable sets E (Z Ai. This condition 
translates to ii{T~^E) = fi{E) in case (i) and to fj,{T~^E) drj{T) = ^{E) in case (ii). In either 
case the measure n is called invariant. The reader may verify that these definitions result, in- 
deed, in a strictly stationary process in that all finite dimensional distributions are shift-invariant: 
Prob(xfci+„ G -El, . . . , Xk^+n e Em) = FToh{xki G -Ei, . . . , Xk^ G E^), for all choices of the indices 
and of the measurable sets. Let / be a measurable function on Ai, which represents a quantity 
whose observed values f{xn) at different times one is interested in. Given an invariant measure 
one may, for example, study the statistical behavior of the sums '^^Iq fi^i) of observations, 
making use of the fact that (/(a;„))„>o is a stationary sequence of random variables. 

A key ingredient in obtaining advanced statistical results on interesting systems is chaos, that 
is to say the dynamical complexity due to sensitive dependence of the trajectories {xn)n>o on 
the initial point Xq. In this paper we initiate a program to free ourselves from the standard 
constraint of stationarity, advocating the following view: 

Much of the statistical theory of stationary dynamical systems can be carried over 
to sufficiently chaotic non- stationary systems. 
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The deliberately imprecise statement above is proposed as a guideline and challenge instead 
of a theorem. We believe that a result obtained for a strongly chaotic stationary system quite 
generically has a non-stationary counterpart if the corresponding non-stationary system continues 
to be sufficiently chaotic. 

The inspiration for undertaking the program stems from non-equilibrium processes in nature 
where it is often unfounded or simply false to assume that an observed system is driven by 
stationary forces. For example, it is conceivable that an ambient system governed in principle by 
measure preserving dynamics is, for all practical time scales, in a non-equilibrium state, so that 
the subsystem actually being observed is better modeled separately in terms of non-stationary 
dynamical rules. The remark is by no means limited to situations of physical interest alone, but 
seems to lend itself rather universally to apphed sciences. Second, from a purely theoretical point 
of view it appears very restrictive to focus only on stationary dynamical models. 

In order to advance the program in a meaningful way, we need a concrete model to work 
with. Deferring technical definitions till later, let the state x„, G of the system at time n be 
determined by the action of the composition T„ o ■ ■ ■ o Ti on the initial state xq € A^, where 
each map Ti : M. M. describes the dynamical rules at time i. For us, the constituent maps 
Ti are topologically transitive Anosov diffeomorphisms on a 2-dimensional compact Riemannian 
manifold A^, which form a prime class of nontrivial chaotic maps. It is clear that some additional 
control is needed; for instance, an alternating sequence of an Anosov diffeomorphism T and its 
inverse T~^ would yield T„ o ■ ■ ■ o Ti = id for even values of n, which does not result in chaotic 
dynamics. To that end, the maps Tj are here assumed to evolve slowly with time but otherwise 
they may do so in a rather arbitrary fashion. We point out that the maps Tj need not be randomly 
picked, there is no assumption of stationarity, and for large n the map T„ may be far from the 
map Ti. Even if all the maps Tj preserved the same initial measure, so that the random variables 
Xn were identically distributed, the process {xn)n>o would typically fail to be stationary. 

We call such compositions non-stationary and think of them as descriptions of dynamical 
systems out of equilibrium. 

We prove in this paper that the system at issue loses memory of its initial state exponentially. 
More accurately, assume Xq has either distribution jj} or jj^ and call and /i^, respectively, 
the corresponding distributions of x„. Our main result states that if /i^ and /i^ are sufficiently 
regular, then the difference / / ~ J f df^n tends to zero at an exponential rate with increasing 
n, provided / is a suitable test function. This type of weak convergence is natural due to the 
invertibility of the dynamics: the supports of the measures yU^ and /i^ will never overlap unless 
they did so initially. Instead, they tend to concentrate increasingly on unstable manifolds due to 
the contracting direction of the maps Tj and then wind wildly around the phase space Ai due to 
the expansion on unstable manifolds. Hence, one cannot hope to identify ever-increasing portions 
of and /x^ unless one first integrates against a test function that possesses some regularity 
along stable manifolds. In spite of the convergence of the difference /i^ — /i^ for arbitrary initial 
measures /i^ and yU^, in general the limit measures limn^oo f^n do not exist individually even in 
the weak sense. It is more appropriate to think that all regular measures are attracted by a 
moving target in the space of measures. 

Finally, let us point out that in the real world, where observations take place on finite time 
scales, one is not interested in the excessively distant future. To underline this, the results here 
are finite-time results, in which the sequence Ti, . . . , T„ is assumed to be known only up to some 
finite value of n. The lack of infinite future leads to certain technical problems to be discussed 
and dealt with below. 
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In [12] analogous results were obtained for uniformly expanding and piecewise expanding 
maps. The situation of the present paper is markedly more complicated because our Anosov 
diffeomorphisms have a contracting direction. Some steps in this direction were taken in [2], 
where mixing for certain arbitrarily ordered compositions of finitely many toral automorphisms 
was established. There are other studies which contain at least some elements that in spirit 
are not very far from our setting. In [3,4] compositions of hyperbolic maps — all close to each 
other — were studied and limit theorems proved. An abstract operator theoretic approach for 
obtaining limit theorems was described in [9], with applications to piecewise expanding interval 
maps. Moreover, symbolic dynamics of non-stationary subshifts of finite type was considered 
in [1]. An extensive literature on random compositions of maps exists. It will not be reviewed 
here, as the present paper concerns quite a different type of questions. Nevertheless, some of the 
techniques developed below should be useful in the context of random maps as well. 

1.2. Structure of the paper. In Section 1.3 we describe the precise setting of the paper. In 
particular, we explain what kind of compositions of maps we are interested in and discuss our 
standing assumptions. After that, the main result of the paper. Theorem 2, is formulated. 
Section 1.4 introduces some basic concepts needed throughout the paper. The Introduction ends 
with Section 1.5, which discusses what the author perceives as the most important contributions 
of the paper, including a technical version of Theorem 2. 

In Section 2 we define finite-time stable and unstable distributions and stable foliations needed 
to keep track of the dynamics with appropriate accuracy. We also prove quantitative results 
concerning the distortion effects of the dynamics. Subsequently, we are able to define in a 
meaningful way finite-time holonomy maps which satisfy useful bounds. 

In Section 3 we formulate the central result of the paper — the Coupling Lemma. It is then 
used to prove Theorem 4, which subsequently implies Theorem 2. The Coupling Lemma itself is 
proved in Section 4, which is the most technical part of the paper. 

To maintain the flow of the discussion, some key technical facts have been separated from the 
main text and presented in the appendices. They are cited in the text as needed. Appendix B 
is of special interest; there we prove the uniform Holder regularity of the finite-time stable and 
unstable distributions introduced in Section 2. 

1.3. Compositions of Anosov diffeomorphisms. Fix Q G N. For each 1 < g < Q, let 
Tq : Ai ^ Ai he a. topologically transitive Anosov diffeomorphism on the 2-dimensional 
compact Riemannian manifold Ai with metric d embedded in an ambient space M*^ ^. The 
Riemannian volume is denoted by m. The map Tg admits an invariant Sinai-Ruelle-Bowen 
(SRB) measure, fig, which is mixing; see for example [5]. In general, such a measure is not 
absolutely continuous with respect to the Riemannian volume. Let Ug = 'B{Tg,eq) be disk 
neighborhoods of small radii > in the topology. Now, pick a finite sequence (T„) of 
Anosov diffeomorphisms such that 

Tn eUg Wn e Ig = {ng-i,ng], (1) 

where = no < rii < ■ ■ ■ < uq. For technical reasons, also set T„ = Tq for all n > uq. We 
assume that the intervals Ig are long enough: 

\Iq\ =ng- Ug-i > Nq, (2) 



Such a diffeomorphism is topologicahy conjugate to an automorphism of the torus. 
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where the numbers A^^, 1 < g < Q, will be assumed suitably large. We will be interested in the 
statistical properties of the compositions 

7; = T„ o ■ ■ ■ o Ti n<nQ. (3) 

The maps Tq serve as successive guiding points which the sequence iTn) follows in the space of 
Anosov diffeomorphisms, spending a sufficiently long time Nq in each neighborhood Uq before 
moving on to W^+i- We also write Tn,m = ■ ■ ■ o for m <n. 

Each Tq admits a unique continuous invariant splitting of the tangent bundle: for each a; G A^, 
TxJ^ = 3, © Eq,xy where the 1-dimensional linear spaces E^'^ depend continuously on the 
base point x, D^TgE'^^ = ~ and DJIaEf,^ = E^ ~ . In fact, in our 2-dimensional setting, 

the dependence on the base point is C^"*"" for some a > 0, because the so-called bunching 
conditions [10] are satisfied. The families E^ = {E^.^.} and E^ = {E^ .^.} are called the unstable 

and stable distributions of Tq, respectively, and their integral curves are called unstable and 
stable manifolds of Tq, respectively. By continuity, the angle between E^ and E^ at each point is 
uniformly bounded away from zero. The maps also have continuous families of unstable cones, 
and stable cones, {C^ ^.j. These can be defined by setting 

Cq,x = + v' -.v-e El,, G E^,, < aq\\v^}, 
Cq,x = K + ^^ ■■ e El,, E El„ \\v-\\ < aq\\v^\\}, 
for some constants a, > such that 
(CI) D,f-Cl, C {0} U intC;~„^ and D,f-^Cl, C {0} U intC;~_„^ if n > Pq, 
(C2) \\D,f^v\\ > CqA^^M if ve CI, and \\D,f-^\,\\ > 5,A^||i;|| if v E C^,, 

for constants Pq > 1, < Cq < 1, and > 1. 
We make the following standing assumptions: 

(AO) Pg = 1 in condition (CI) above. 

(Al) D^TCl, C {0} U int C^tx and D^T-^C^, C {0} U int C^^-i, for all T E Uq. 

(A2) There exist constants < Cg < 1 and Ag > 1 such that, if each Tj E Uq for a fixed q, 

WD^TnvW > CqA^\\v\\ ifvE CI, and \\D,r~'v\\ > CqA^\\v\\ iivE C^,. 
(A3) D^TCl, C {0} U intq+i^r, if T e Uq+i and D^T-^C^^^^, C {0} U intC^^.,, if T G Uq. 
(A4) The numbers Og can be assumed small. 

Convention 1. From now on we will assume that Q reference Anosov diffeomorphisms Ti, . . . ,Tq 
have been fixed. When we say that a result does not depend on the choice of the sequence {Ti), 
we mean that the result holds true uniformly for all finite sequences (Tj)"^-^ of any length nq, 
provided (1) and Assumptions (A) are satisfied and the numbers Nq appearing in (2) are large 
enough. 

Given < 7 < 1, we say that a function / : — > M is a 7- Holder continuous observable, if 

We are now in position to state our main theorem, which is reminiscent of weak convergence 
of measures in probability theory. 
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Theorem 2 (Weak convergence). There exist constants < rj < 1 and C > 0, for which the 
following statements hold. Let dfi^ = p^dm (i = 1,2) be two probability measures, absolutely 
continuous with respect to the Riemannian volume m, such that are strictly positive and r]- 
Hdlder continuous on M.. If f is continuous, then 



where Af{n) = o(l). Given < 7 < 1, there exist constants < 6^^ < 1 and = C^/{p^,p^) > 
such that, if f is '^-Holder, then Af{n) = C^BfO"!^ with Bf = C(sup/ — inf /) + |/|^. In either 
case, the various constants do not depend on the choice of the sequence (Ti), in particular its 
length uq, as long as the earlier assumptions hold and the numbers Ng appearing in (2) are large 
enough. Among the constants only depends on the densities p\ and in fact it only depends 
on the Holder constants of In p* . 

In other words, if / is continuous, the difference between the two integrals f °Tn dp} is 
eventuaUy arbitrarily small, assuming there are sufficiently many maps in the finite sequence 
{Ti) of Uq maps. The latter means that at least one of the intervals Iq in (2) is sufficiently 
long, and consequently uq is large. What is more, the rate of convergence is exponential, if / is 
Holder continuous. By approximation, one can get an o(l) estimate also for general continuous 
densities p*. 

Let us emphasize once more that despite such convergence or pairs of measures, it does not 
make any sense to speak of a limit measure, because the maps T„ keep evolving with time — 
possibly drifting very far from Ti. Furthermore, all observations in our theorems are restricted 
to times not exceeding (the arbitrarily large but finite) uq. 

Theorem 2 remains true for much more general, SRB-like, initial measures. It is enough that 
each measure p* can be disintegrated relative to a measurable partition such that the partition 
elements W & are smooth unstable curves with respect to the cones {C"^.} with uniformly 
bounded curvatures and the conditional measures p^\W have regular densities. See below for 
details. 

In our formulation of the theorem, the convergence rate 9^ is constant. The latter depends on 
the reference diffeomorphisms Tg, 1 < q < Q. A sharper, variable, convergence rate that depends 
also on the time interval Iq that n belongs to, can be deduced from the proof. 

We finish the section by discussing Assumptions (A) and how they could be relaxed. 

Assumption (AO) is one of convenience; we could as well assume that Tp^ o ■ ■ ■ oTi is sufficiently 

close to T^«, but have opted for a streamlined presentation. Assumptions (Al) and (A2) state 
that compositions of maps belonging to Uq have similar hyperbolicity properties as powers of Tq. 
The following lemma is proved after a few paragraphs: 

Lemma 3. Assumptions (Al) and (A2) are satisfied if Sq is sufficiently small. 

Assumption (A3) guarantees that hyperbolicity prevails when a transition from Uq to Wg+i 
occurs. The first part of (A3) could be relaxed by replacing the map T G Wg+i by sufficiently 
long compositions 7^ = T„ o ■ ■ ■ o Ti of maps with each Tj G Uq+i. given a sufficiently large 
rq > 0, D^TnCg^^ C {0} U int Q+i t^^^. if > and Tj G Uq+i for 1 < z < n. The assumption 
is then satisfied, for example, if Uq fl W^+i 7^ and if Sq is small, for all q. However, Uq and 
Uq^i need not overlap or even be close to each other, as most vectors in the tangent space T^Ai 
get eventually mapped by 7^ = T„ o ■ ■ ■ o Ti into C^_^^^ j-^^ if each Tj G Uq+i and if e^+i is small. 




n < Uq, 
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Similar remarks hold for the second part of (A3). This way, the sequence (T„) used to build up 
the compositions (3) might, without affecting our analysis, involve occasional long jumps from 
one neighborhood Uq to the next, as long as the number of steps \Iq\ spent in each neighborhood 
Uq, see (2), is sufficiently large. 

Assumption (A4) means that the cones can be assumed narrow. This is not restrictive for 
our purposes either, as it follows from (C2) that arbitrarily narrow cones can be treated by 
considering sufficiently long compositions of maps in a given Uq with a sufficiently small Sq. 

Recapitulating, it would be adequate to assume that the properties above hold eventually, for 
sufficiently long compositions of maps, and in this case the assumptions are very natural and 
easily fulfilled. For technical convenience and notational ease, we assume from now on that all 
the nice properties hold immediately, after the application of just one map. 

Proof of Lemma 3. We will prove the claims for unstable cones, going forward in time. Similar 
arguments work for the stable cones, by reversing time. 

(Al): Suppose T G Uq. By the chain rule D^TCl^ = Df^^{Tf-^)D^fqCl^, where D^fqC^^ C 
{0} U intC^-^. By the continuity of the cones with respect to the base point and the fact that 

Df^^{Tf~^) = 1 + 0{eq) 2, we have D^TCl^ C {0} U intQ^^, provided Sq is sufficiently small. 
Compactness guarantees that Eq can be chosen independently of x. 

(A2): For each i and x, D^Ti = D^Tq+Ei^x, where supj = 0{eq). We can bound \\DxTn— 

D^T^W < C{N)eq. If Sq is sufficiently small, we have ||D^7^f || > \\D.^f^^v\\ - C{N)eq\\v\\ > 
|CgA^||t;|| for V G Cg,j,. Now assume = N{q) is so large that |CgA^ > 1. Here A^ depends 
neither on x, on v, nor on the choice of the maps T, G Uq. Let us set A, = (|CgAf The 
uniform estimate llDj:'^^'^!! ^ Cq\\v\\ holds with some Cq = Cq{N) for 1 < n < A^. Now, assume 
n = kN + I, < I < N. Then ||Dj.7^t>|| > Cq\\DxTkNv\\ > CgA^''^||t>||. Thus, we can take 

Cq = Cq/A^. a 

1.4. Unstable curves with smooth measures. We call a smooth curve W G Ai unstable 
with respect to a,} if its tangent space at each point x G is contained in the unstable cone 
C^j,, i.e., T^W C Stable curves are defined similarly. Let W{x, y) C W denote the subcurve 
of W whose end points are x,y E W. The length |iy | of a curve W is given by 

\W\ = / draw, 
Jw 

where m^y stands for the measure m\Y on W induced by the Riemannian metric. Also, let n{W) 
stand for the maximum curvature of W: if u{x) is a unit tangent vector oiW&tx depending 
smoothly on x, then k{W) = sup \\u ■ Vu\\. 

It is convenient to consider curves of bounded length and curvature only. Hence, we introduce 
two length caps, L and i < L, and a curvature cap K, and say that a smooth curve W is standard, 
ii i < \W\ < L and if K{TnW) < K for all n > 0. If \W\ > L, we can always "standardize" 
it by cutting it into shorter subcurves. If an unstable curve is of length less than £, it will 
eventually grow under the application of the sequence (Tj), such that \TnW\ > £ for sufficiently 
large n. The dynamics also flattens unstable curves, such that K,{TnW) < K for all sufficiently 
large n, even if k{W) > K. We will confirm these last two facts in the following. Finally, there 



^Here it is understood that M is embedded in the ambient space M^^ and that Z?^ ^{TT^^ ^) acts between the 
linear subspaces ^.M and TtxM of M*^. 
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are no discontinuities which would introduce more short curves under the dynamics by cutting 
longer ones. Taking these considerations into account it is quite natural to commit to the mild 
constraint that all curves are standard curves to begin with. This will help keep the somewhat 
technical discussion as clear as possible. 

A standard pair (W,iy) (w.r.t. {Q^.}) consist of a standard unstable curve (w.r.t. {0^,^.}), W, 
and a probability measure, u, on W. The measure u is assumed absolutely continuous with 
respect to m^y on W with a density, p, that is regular in the following sense: for some global 
constants Cr > and ?7r G (0, 1] to be fixed later ^, 

\\np{x)-\np{y)\<a\W{x,y)\^^ (4) 

for all X, y G W. In particular, 

inf p ~ ' 

which implies inf p > ^ji^e"*"''^'" > ■|^e~*"""^'* > 0, since J^^ pdmw = 1- Moreover, inf p < If 
W C W, we obtain by using the previous facts that 



Hence, ii D = e^^-^^"' and W, W" C W, 



\W"\ ~ \W'\ ~ \W"\ ' 

Formally, a standard family is a family Q = {(Wa,i^a)}a& of standard pairs together with 
a probability factor measure Xg on the (possibly uncountable) index set 21 and a probability 
measure pg satisfying 



pg{B) = / iy^{BnW^)dXg{a) 

J 21 

for each Borel measurable set i? C A^. The measure pg is supported on Uq,VFq, and 



= / fdfJ'g= / f{x)duo,{x)dXg{a 

JM J%JWr, 



for each Borel measurable function / on A^. In Theorem 4 we assume that a standard family is 
associated to a measurable partition. 

A standard family can, for example, consist of just one standard pair {{W, p)} and the Dirac 
point mass factor measure 5w- Another natural example of a standard family {{Waii'a)}a&% 
is obtained by considering an Anosov diffeomorphism and taking as {VFajaga a measurable 
partition consisting of unstable manifolds of bounded length and letting the Riemannian volume 
induce the factor measure and the conditional measures Va- 



1.5. Main contributions. A technical version of our main result is the following theorem. It 
states that for reasonable initial distributions pg and ps^ the images TnPg and TnPs converge 
exponentially in a weak sense. 



Cr has to satisfy the condition in Lemma 9 and -q^ is determined in Lemma 22. Both depend on the reference 
sequence Ti, . . . , Tq, but not on the choice of (Ti). 
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Theorem 4. There exist constants C > and < < 1, and < A < 1, such that the following 
holds. For any standard families Q and £, any 7 > 0, and any '^-Holder observable f ^, 

< BfO:;, n < UQ, 

where, 

5/ = C(sup/-inf/) + |/|^ and = max(^9, A^)^/^ 

The various constants do not depend on the choice of the sequence (Ti), in particular its length uq, 
as long as the earlier assumptions hold and the numbers Nq appearing in (2) are large enough. 

As a consequence of Theorem 4, we prove the earher Theorem 2, which is stated in terms of 
less technical notions and is closer in spirit to the weak convergence of probability theory. 

The proofs of Theorems 4 and 2 rely on a coupling method that has its roots in probability 
theory. It was carried over to the study of dynamical systems by Lai-Sang Young [11,13] who 
used it to prove exponential decay of correlations for Sinai Billiards and uniqueness of invariant 
measures for randomly perturbed dissipative parabolic PDEs. Bressaud and Liverani [5] also 
used coupling to give explicit estimates on the decay of correlations for Anosov diffeomorphisms. 
The present paper takes advantage of a version of Young's coupling method introduced by Dmitry 
Dolgopyat and Nikolai Chernov [7, 8] . 

A considerable amount of work is devoted to obtaining uniform bounds, which is more involved 
than in the case of iterating a single map. A central issue is that the finite sequences (Ti, . . . , T^q) 
of maps that we consider do not possess stable and unstable manifolds, because defining such 
objects requires an infinite future and an infinite past, respectively. Thus, we have to resort to 
artificial, finite-time, foliations that describe the dynamics sufficiently faithfully but are by no 
means unique. Moreover, in the single map case the regularity properties of the foliations of the 
manifold into stable and unstable manifolds play an important role. Our construction should 
therefore also yield regular foliations. In addition, the amount of regularity must not depend 
on the choice of the sequence (Ti, . . . ,T„q) (as long as Q > 1 and the maps T^, 1 < g < Q, 
have been fixed and the earlier assumptions are satisfied), since the goal is to prove the uniform 
convergence result in Theorem 4. 

At the heart of Dolgopyat 's and Chernov's method lies the Coupling Lemma (corresponding 
to Lemma 13). In its proof, one constructs a special reference set called the magnet. By mixing, 
any standard pair will ultimately cross the magnet as if it was attracted by the latter. Once 
two standard pairs cross the magnet, parts of them can be coupled to each other using the 
stable foliation. In this paper, we generalize the idea by considering time-dependent magnets 
and time-dependent, finite-time, foliations for the coupling construction. 

2. Distortions and holonomy maps 

2.1. Stable foliations W". As pointed out above, there is no well-defined sequence of stable 
foliations associated to the finite sequence (Ti, . . . ,T„q) of maps. A way around this is to try 
to augment the sequence with a fake future consisting of infinitely many maps — in our case 
Tn = Tq for n > Uq — and to consider the uniquely defined stable foUations of the resulting 
infinite sequence of maps. This sequence of stable foliations naturally depends on the chosen 
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Given a sequence (Ti), it is in fact enough to assume that / is Holder continuous along the finite-time stable 
leaves associated to that particular sequence; see Section 2. 
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future and it is not a priori clear whether they have very much to do with the finite-time dynamics 
(1 < n < ng) which is the only thing we are interested in. 

For a sequence (Tj) satisfying the earlier assumptions, we can define a sequence of stable 
distributions on the manifold A^, by pulling back the stable distribution Eq^ of Tq. More 
precisely, let us first define E"^ = Eq ^ for n > rig + 1 and then 

E: = Z}T„+,.T-ii?^+\,, < n < ng. 

With this definition, 

DxTn^^E:^-' = El^^,, n>m>l. 

Assumptions (Al) and (A3) guarantee that E^ C {0} U intC^ .^ for n + 1 G Jg and I < q < Q- 
By Assumption (A4), the angle between E!^ and E^^ can be assumed uniformly small, for n & Iq 
and I < q < Q. 

The distributions -E" above are the tangent distributions to the stable foliations W" of the 
sequences (Ti)j>„. If Wg .j, is the stable leaf of Tq at x, then = Wg 3, for n > ng + 1 and 

W: = T-+\>V^+\,, 0<n<nQ. 

Notice that y G W" if and only if limAr^oo d{TN,n+ix, TN,n+iy) = 0. 
For technical reasons, we also define = E'^^ for n < and then 

F: = D^-.,T„FJr,^, l<n<nQ. (6) 

Assumptions (Al) and (A3) guarantee that C {0} U intQ^, for n E Iq and 1 < q < Q. By 
Assumption (A4), the angle between F^ and E'^^ can be assumed uniformly small, for n E Iq 
and 1 < q < Q- The distributions F"- are in fact the unstable distributions of the sequence 
(Tj) augmented with the past = Ti for i < 0. They serve as Holder continuous reference 
distributions that allow us to accurately compare different unstable vectors. 

In Appendix B we show that the distributions F" and E" for all n are uniformly Holder 
continuous. 

2.2. Distortion. It is necessary to control the distortion and growth of curves under maps T. 
Given a curve W we denote by JJwT the Jacobian of the restriction of T to W. If v is any 
nonzero tangent vector of W at x, then 

JwT{x) = " „"„ " . 

||f II 

Lemma 5 (Growth of unstable curves). Fix 1 < q < Q and let Ti G Uq for each i. If W is an 
unstable curve with respect to {Cq^}, setting Aq = sup^\\DxTq\\ + Eg, 

CqAg\W\ < IXWl < Aq\W\. (7) 

IfW, TiW, . . . , TnW are stable curves with respect to {C^ ^,}, then 

\W\ 



ITnWl < 



Proof Since \TnW\ = jj-^^ dmr^w = JwJwTndmw = Jy^^\\DxTnVx\\ dmw{x), where is a unit 
vector tangent to W at x, it suffices to observe that C^A^ < ||D^7^t>^|| < A^ in the "unstable 
case" and ll-DxT^'^^xll < tAtt in the "stable case". □ 
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Lemma 6 (Curvature of unstable curves). Fix 1 < q < Q and let Ti G Uq for each i. There exist 
Ki and, for any K' > 0, K2{K') and n^{K'), such that 



K{TnW) < 



Ki, n > n^, 
K2, n > 0, 



holds if W is an unstable curve with respect to {C^^} and k,{W) < K' . Notice that Ki < K2 is 
independent of K' . 

Remark 7. We can now fix some K' and set K = K2{K') in the definition of standard pairs. 
In particular, this means that any unstable curve W with length between £ and L and curvature 
k{W) < K' is a standard curve. 

Proof of Lemma 6. Let W be an unstable curve and 7 its parametrization by arc length, such 
that u{x) = ■jit) G C^.j. with x = ■y(t). Note = 1. The curvature of W at x is the length of 

7(t) = u{x) ■ Vm(x). 

Setting V = DTnU, v{y) = V{x)/\\V{x)\\ is the unit tangent of TnW at y = TnX. The curvature 
of TnW at y is thus obtained from 

v{y) ■ Vv{y) = Dv{y)v{y) = \\V{x)\\-'Dv{y)Drn{x)u{x) = \\V{x)\\-'DMy)M^)- 
Here the chain rule Dx{v{y)) = Dv{y)D%i{x) was used. Now 

D^{v{y))u{x) = D{\\V{x)\\~'^V{x))u{x) = \\V{x)\\~^DV{x)u{x) + V{x)D{\\V{x)\\~^)u{x) 
= \\V{x)\\~^DV{x)u{x) + V{x) {-\\V{x)\\~^V{x) ■ DV{x)) u{x) 
= \\V{x)r'DV{x)u{x) - \\V{x)\\-hiy) {v{y) ■ DV{x)u{x)) , 

such that 

v{y) ■ Vv{y) = \\V{x)\r' {DV{x)u{x) - v{y) {v{y) ■ DV{x)u{x))) , 

or compactly 

vVv= (DVu -v{v DVu)) . (9) 

Notice that DVu — v {v ■ DVu) is the component of DVu orthogonal to v and hence ||-DV^m — 
V [v ■ DVu) II < ||DV^m||. Furthermore, as Duu = u ■ Vm, which we recognize to be the curvature 
of at X, we have 

DVu = D'^rn{u,u) + DTniu-Vu). (10) 
Using Lemma 26 and ||Z?7^m|| > CqAg||n||, we see from (9) and (10) that 

\D^Tn{u,u)\\ , \\DTn{u-Vu)\\ 



\v-Vv\\ < W\\~'^\\DVu\\ < 



\DTnU\\^ WDTnuf (11) 



< {C,K)-^D'Tn\\^ + (C,A")-iC#||n ■ Vu\ 



Fix an such that {Cqh.'^)~^C# < 1. Iterating (11), 

(aAf)-2sup,^.jD2r^ll ^ 
<%nmW) < i-(c^Ap) + {{CqK^,r'c^f<r,w). 

A uniform bound maxo<;<Ar k{TiW) < a + h ■ n{W) is also obtained, so we are done. □ 
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If W carries a measure u with density p, then TnW carries the measure 7^i^ whose density, 
which we denote TnP, is 



For controlhng the regularity of such densities, we have the following result. 

Lemma 8 (Distortion bound). Fix 1 < q < Q and let Ti G Ug for each i. IfT~^W is a standard 
unstable curve with respect to {C^x) f^^ all < n < N and if x,y G W , then 



In 



Jwr-\x) 



JwT-\y) 



<Cd,,|l^(x,|/)|, n<N. 



Here C^^q > is a constant that do not depend on W or the choice of [Ti). 

Proof. We first prove that the distortion factor of any map T G is close to that of Tg. To this 
end, let W be an unstable curve with respect to {Qa,}, x G W, and v a unit vector tangent to 
W aX X. Then 



JwT-\x) - JwTg-\x) 



< C\\Df-.Jg - DT-iccT\\Jwfg-\x) < CegJwfg-\x). 
Next, let 7 parametrize W{x, y) according to arc length. Because 



JwT-\^it)) 



\D,^t)T-mt)\ 



< 



dt 

|(D,,)T-S(t)) 



D,^t)T-^^{t) d 



\D^^t)T-^^{t)\\ dt 



< sup||D^2— 1|| + sup||L'^T~^|| ||7 

X X 

and because the curvature ||7|| < K for all standard unstable curves, 

\Wix,y)\ ^ 



\\nJwT-\x)-\nJwT-\y)\ 



dt 



{\nJwT-\^{t)))dt 



^(^..)l -|j-^T-H7(t)) 
JwT-^{l{t)) 



dt 



<C^jW{x,y)\, 



where C^^g is independent of the choice of T. The desired estimate follows. Indeed, writing 

= (7;,„_j+i)"^x, y-^ = {Tn,n^j+i)~'^y, and = (7;,„_j+i)~^Vr (with Tn,n+i = id). 



JwT-\y) 



In 



n~l 



<Y,CA,q\W-^{x-\y-^) 



Moreover, by (7), \W-^ {x-\y-^)\ < C-^k~^\W{x,y)\. 



□ 
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2.3. Image of a standard family. 

Lemma 9. Fix 1 < q < Q and let Ti G Uq for each i. Let Q = {W, v) he a standard pair with 
respect to {Qa,} and assume that satisfies 

For n > In^/lnAg, denote by Wi the (finitely many) standard pieces of the image %iW after 
it has been standardized by cutting into shorter pieces and split the image measure Tn^ into the 
sum where Ui is a probability measure on Wi and ^jCj = 1. Then each (PFj, z/j) is a 

standard pair w.r.t. {C^^}. 

Proof. We only need to check that the density, pi, of Ui is regular. For x G Wi, Pi{x) = JwiT~^{x)- 
p{T~^x) / Ci. Thus, for any pair x,y E Wi, 



lnpi{x) - \npi{y)\ < \ lnp{X ^x) - lnp{X ^y)\ 



^^Jw.T-\x) 



<C,\W{T~'x,T~'yT + C^,^m{x,y)\ 

We used Lemma 8 and also Wi{x, y) = Tn{W{T~^x, T~^y)) together with Lemma 5. □ 

Thus, Qn = {{Wi, Ui)} is a standard family equipped with the factor measure \g„{i) = Ci. More 
generally, if ^ is a standard family, Qn obtained by processing each standard pair in a similar 
fashion is a standard family. 

2.4. Holonomy maps. A holonomy map is a device needed in the coupling construction for 
coupling some of the probability masses on different points. Let Wi and W2 be two unstable 
curves (w.r.t. {C]*^}) connected by the stable foliation W°. In other words, for each point 
X G Wi the leaf >V° intersects W2 and conversely for each point y G W2 the leaf >V° intersects 
Wi. We assume that the curves Wi are close enough and not too long, so that the connected 
pairs (x, y) G Wi x W2 are uniquely defined by demanding that the connecting leaf be shorter 
than a small number Iq < 1. Then the holonomy map h : Wi — )■ W2 is defined by sliding along 
the leaf: hx = y. Since the images TnWi are connected by the stable foliation W", one can define 
the holonomy map h„ = 7^ o h o : TnWi — )■ TnW2. 

Remark 10. Notice that if the curves Wi carry measures Vi that are compatible in the sense that 
V2 = hi^i, then the images TrWi carry compatible measures: 7^z/2 = h„7^i^i. This will guarantee 
in the following that once some of the masses on two points have been coupled to each other, they 
remain coupled. 

The holonomy map h is said to be absolutely continuous, if the measure h~^mw^ is absolutely 
continuous with respect to the measure ■ In this case the Jacobian, which measures distortion 

under the holonomy map, is defined as the Radon-Nikodym derivative J^h = ^ ■ The 

change-of-variables formula for integrals is dm\;\r^{y) = J'h{x)dm];\r^{x) with y = hx ^. For any 



^Given a Borcl set A C Wi, wc have JY^j^dmw2 ~ (h ^'mw2)iA) ~ '^^^drn^r^^^ dmwi- 
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X e Wi, we have hx = 7^ ^h^T^a;. It is elementary to check that if h„ is absolutely continuous, 
then h inherits this property via the identity 



Jw^Tnihx] 



■ Jhn{TnX) 



(12) 



By reversing the argument, we see that if is absolutely continuous for some m, then h„ is 
absolutely continuous and (12) holds for all values of n > 0. 

Lemma 11 (Absolute continuity of the holonomy map). Let h be as above. It is absolutely 
continuous. Moreover, there exist constants ci > 1 and < fi < 1, independent of the curves Wi 
and W2 and the choice of the sequence {Tif^^i, such that 



\\YiJ\Yn{TnX)\ < Ci/i" 

holds for X G Wi and < n < uq. In particular, e~'^^ < J\i < e*^^ . 



(13) 



As a curiosity, (13) continues to hold for n > riq since T„ = Tq. In particular, the precise value 
of J'h(x) could be obtained as the limit lim„^oo ^ r"(hi) • However, we only care about n < nq. 
What is important above is that ci and yU do not change when the lengths of the intervals Iq in 
(2) and hence the value of uq are increased arbitrarily. 



Proof of Lemma 11. Denote x"^ = TnX and y = Yix loi x E Wi, and = TnWi for i = 1,2. 
We also write ?/" = Tny = h^x". Since T„ = Tq and W"~^ = Wq .j. for all x, for all n > hq, 
we know the following: h inherits absolute continuity from h^^, (12) holds for all n > as 
explained above, and lim„_j,oo i7h„(a;"') = 1. Therefore, with the aid of the chain rule J^wTn{x) = 
J'w^-^Tn{x"'~^) ■ ■ ■ j7vi/oTi(x°), we conclude that 



n 

n>m 



Jw^Tn+l{x''' 



By Assumption (A3), we may use Lemma 27 on each of the time intervals Iq. Since |ln2;| < 
max(z — 1, — 1) for all 2 > 0, we see that (27) implies (13): 



|ln:7h„(a;'")| < 



n>m 



In 



J^W^Tn+l{x'^) 



<C'- 



□ 



Lemma 12 (Regularity of the holonomy map). There exist < ?7h < 1 and Ch > 0, such that 
the following holds. Let Wi and W2 be standard unstable curves connected by the stable foliation 
W° as above. For Xi,X2 G Wi such that \Wi{xi,X2)\ < I, 



Proof. Denote x" = 
= TnW2. By (12) 



|lnjh(xi) -lnjh(x2)| < Ch|^^l(a:l,X2)|''^ 
TnX for all X in = TnWi. We also set yi 



\njh{xi) -\njh{x2)\ < 



In 



Jwi%n{x2) 



+ 



In 



Jw2'Tm{y2) 
Jw2'Tni{yi) 



= hxi, yi = Tnyi, and 
J2\^nJh^{xT)\, 



i=l,2 
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for all m. The last sum can be bounded with the aid of (13). Using the chain rule J^w'T'ni^) 
J'iY„-iTn{x^~^) ■ ■ ■ Ji^oTi^x^), Lemma 8 and the bounds (7), 



In 



(X2 



< E 

0<'?<Q-l 

0<9<Q-1 

0<g<Q-l 

< Yl c'd,,+iA;;r""^---Ar"'^^Ariivi(xi,x2) 

0<g<Q-l 



< ( max Cda ]\Wi(xi, X2)\ > ( max A 

Vl<<?<Q / " ^ \l<q<Q 



< 



1<9< 

maXi<q<Q Cd,q 



l<g<Q 

max A, 



1- (maxi<g<QA,) ' V<^<Q 

CA"«|iyi(Xi,X2)|. 



\WAxi,X2) 



Similarly, for any m, 

J^Wi'T'miXi) 



In 



Jwi'Tm{x2) 



<CA"'\Wi{xi,X2)\ and 



In 



Jw2%n{y2) 



Jw2'Tm{yi 



<CA^\W2{yi.y2)\. 



Notice from the definition of C and A that they are independent of the curves Wi and of the 
sequence (Tj). Because \W2{yi,y2)\ < sup^^f^^^,^^^ Jh ■ \Wi{xi, X2)\, 



In 



JWx'Tm{x2) 



In 



Jw2'Tm{y2 



< CA™(1 + e^^)|iyi(xi,X2)| = C2Pr\WAxi,X2) 



Jw2^m{y\) 

en [f^ = \Wi{xi,X2)\, A"^ = \Wi{xi,X2)\^''^' 
\njh{xi)-\njh{x2)\ < 2cifi"' + C2A"'\Wi{xi,X2)\ < (2ci + C2)|iyi(xi, Xs)!^-^'^^/!^"''! 



Finally, choose m = , Then //"^ = \Wi{xi,X2)\, A"^ = \Wi{xi, X2)\^''^^^''^, and 



□ 



3. Coupling Lemma and the proof of Theorems 4 and 2 



Let {W, v) be a standard pair and dv = pdm^r. We will be interested in densities of the form 
rp where r : — j- [0, 1] is a function. These can be considered as portions of the measure u. In 
practice, we will replace W by the rectangle 1^ = x [0, 1] with base W and du by the measure 
dv = dv ® dt, where dt denotes the Lebesgue measure on [0,1], and look at the subdomain 
{{x,t) &W : <t < t{x)} of W. Introducing the rectangle facilitates bookkeeping. 

A standard family Q = {(Wa,Va)}a£<ii can similarly be replaced by ^ = {(Wa,Va)}ae^- The 
measure fig induces canonically a measure fig on UaWa- A map T on A4 extends to a map 
on X [0, 1] by setting T{x,t) = (T(x),t) and all observables on A4 extend to observables on 
M X [0, 1] by setting f{x,t) = f{x). 

We are now in position to state the following key result. 
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Lemma 13 (Coupling Lemma). Consider two standard families Q = {{Wa,i^a)}a& o-nd £ = 
{(Wj3,h'i3)}j3^<s- There exist an almost everywhere defined bijective map 6 : UaWa — )■ UjsWis, 
called the coupling map, that preserves measure, i.e., &{fig) = fi£, and an almost everywhere 
defined function T : UaWa — ?■ N, called the coupling time, both depending on the sequence (Tj), 
such that the following hold: 

(1) Let {x,t) G Wa, a G 21, and 6(x,t) = (y, s) G Wjs, /3 G 03. Then the points x 
and y lie on the same leaf, say W , of the stable foliation >V°. If n > T(x,t), then 
the distance of the points TnX and Tny along the leaf TnW of the stable foliation W" 
satisfies \TnW{TnX,Tny)\ < £oA"~^*-^'*^. Here io > has been introduced earlier and 
A = maxi<<y<Q < 1. 

(2) The exponential tail bound 

figiT >n)< Ct?9t 
holds for uniform constants Cr > and 'dr ^ (0, 1). 
Proof of Theorem J^. We use the coupling between Q and E given in the Coupling Lemma: 



(14) 



f oTndfXg 



M 



f oTndfl£ 



M 



if °Tn){x,t) dfLg{x,t) 



A1x[0,l] 



if °'T'n){y,s) djis{y,s) 



A1x[0,l] 



(/ O Tn){x,t) dflg{x,t) 



Mx[0,l] 



if 



oTnO Q){x,t) dfLg{x,t) 



Mx[0,l] 



if °Tn- f oTnOQ) dflg 



A^x[0,l] 



/ if °Tn - f oTnO Q) dflg + if O % - f O % O Q) dflg . 

JT<nl2 JT>n/2 



By (14), 



if oTn- f oTnOQ) dflg 



T>n/2 



<CT(sup/-inf/)i9: 



n/2 
T • 



On the other hand, assume T{x,t) < n/2. Then |(/ o ?; - / o ?; o e)(x, t)| < |/|^(4A""'^('^''*))^, 
by the Coupling Lemma, such that 



if oTn- f oT„oQ) dflg 



'r<n/2 

Since io < 1, the proof is complete. 



< QfLX'''^' 



□ 



Proof of Theorem 2. First notice that both of the measures yU* can be disintegrated using a 
suitable measurable partition of Ai so that we almost obtain two standard families, with the 
nuisance that the Holder constants of the logarithms of the conditional measures possibly exceed 
Cr in (4). In the latter case we need a finite waiting time = N{p^, p^), depending on the Holder 
constants of Inp*, until the densities regularize and yield true standard families; see the proof 
of Lemma 9. For 7-Holder observables the result then follows immediately from Theorem 4, 



with the above waiting time giving the constant C^(p^,p^ 



If / is only continuous. 



we fix an arbitrarily small e > and, by Stone-Weierstrass theorem, pick a 7-Holder such 
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that II/-MI00 < e. Then \J^foTndfi'- Jj^foTndfi^\ < C^{p\ p^)Bf^e:; + 2e < 3e if 
n>\n{e/C^{p\p^)BfJ/\ne^. □ 

4. Proof of the Coupling Lemma 

4.1. Outline of the proof. The idea of the proof is to construct special tiny rectangles, called 
magnets, which can be thought to attract unstable curves. Mixing guarantees that a small 
fraction, say 1 percent, of any high enough iterate of any unstable curve will ultimately lie on a 
magnet. Once two unstable curves from two different standard families cross a magnet, we are 
able to couple a fraction of their masses by connecting some of their points lying on the magnet 
with very short stable manifolds. This has to be done with due care, because the resulting 
coupling has to be measure preserving. 

The process is then repeated recursively, and so the construction of the coupling map and 
the coupling time function T is recursive. It can be shown that after a fixed finite number of 
iterates a fixed fraction of the remaining masses can always be coupled, so that the measures on 
the unstable curves can be 'drained' at an exponential rate. 

Since we are dealing with compositions of diffeomorphisms from the sequence (Tj) rather than 
iterates of a single diffeomorphism, we need to use time-dependent magnets. For n G Iq, Tn G Uq, 
and the magnet to be used should reflect the structure of the reference diffeomorphism Tq. In 
our time-dependent, finite time, setting it is not even a priori clear what coupling should mean. 
We choose to construct a coupling via the stable foliations W" that vary from one point in time 
to the next. As mentioned earlier, these foliations are artificial in the sense that they depend 
on the artificial future T„ = Tq for times n > uq although in reality we only consider the 
compositions 7^ = T„ o ■ ■ ■ oTi for n < uq. We then have to pay special attention to uniformity: 
our convergence rates, etc., should depend neither on the particular value of uq nor on the 
particular finite sequence (Ti, . . . , T^^) as long as the reference automorphisms (Ti, . . . , Tq) have 
been chosen and the earlier assumptions on (Ti, . . . , T„q) are being respected. 

4.2. Magnets and crossings. In this subsection 1 < q < Q is fixed for good. Unstable curves 
and standard pairs are to be understood as being defined with respect to the cone family {Q 3,} 
with q fixed. 

Consider the Anosov diffeomorphism Tq. A 'rectangle', 9^ C A^, is a closed and connected 
region bounded by two stable manifolds and two unstable manifolds of Tq. These are called the 
s- and u-sides of the rectangle, respectively. Recalling that 3. denotes the stable leaf of Tq 
at X, we also assume that the size of the rectangle in the stable direction satisfies |Wq ,j,nSH| <^ Eq. 

We say that an unstable curve W crosses the rectangle properly, if 

(PI) W crosses completely, i.e., W HDl contains a connected curve W connecting the two 

s-sides of the rectangle, and 
(P2) both components of W \ W are of length strictly greater than £/10, 

both hold. In other words, a crossing is proper if the curve crosses the rectangle completely and 
there is a guaranteed amount of excess length beyond each s-side of the rectangle. Here £ is 
the lower bound on the length of a standard curve. Finally, an unstable curve W crosses the 
rectangle super- properly, if (PI), 

(P2') both components of W \ W are of length strictly greater than i/5, and 
(P3) each x & W (l^i divides the curve W^^, fl in a ratio strictly between 1/10 and 9/10. 
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all hold. Thus, in a super-proper crossing there is more guaranteed excess length than in a proper 
crossing and the curve also stays well clear of the u-sides. 

Lemma 14. There exists a finite set of rectangles, {"iPi^ : 1 < ^ < k^], such that each standard 
unstable curve crosses at least one of the rectangles super-properly. 

Proof. Every closed standard curve crosses some rectangle 91 super-properly. Since crossing a 
rectangle 9^ super-properly is an open condition in the Hausdorff metric, the set Wf^ of all closed 
standard curves crossing 91 super-properly is an open set. The collection formed by all the sets 
Uy\ is an open cover of the space of closed standard curves equipped with the Hausdorff metric. 
The latter space is compact. We can therefore pick a finite subcover and the corresponding 
rectangles. □ 

We now pick arbitrarily one of the rectangles DV'. This special rectangle, that we will denote 
by Dig, will be called a magnet. It will serve as a reference set on which points will be coupled. 

Lemma 15. Fix n > 1. By taking Sq sufficiently small (depending on n) the following holds. 
If W is an unstable curve and T^W crosses 9Kq super-properly, then TnW crosses properly, 
provided each Ti ElAq. 

Proof. For any point x, we have the bound d{TnX,TgX) < C{n)eq. □ 

If W is an unstable curve and n is fixed, let W^^, i E 3, he the connected components of 
TgW nDlq, that correspond to super-proper crossings. That is, each W^^ is a subset of a longer 
curve j C T^W which crosses dig super-properly and VF^ ^ fl 9^^ = W^^. 

Lemma 16. There exist a subrectangle Q3g C ^Hg and a number s' > 1 such that the following 
holds. Assume that W is an unstable curve that crosses a rectangle 9^'^ properly, n > s' , and 
T^D^*^ n 95g 7^ 0. Every component ofT^^i'' fl Dig that intersects fBg intersects W^^ for precisely 
one value of the index i e3. 

In words, each intersection of T^^^ with 53g yields a super-proper crossing of T^W , as long 
as n is large enough. 

Proof. We assume that the magnet D^^ is so small that the leaves of the unstable foliation of Tg 
are almost parallel lines on y{g. This can be guaranteed by considering only sufficiently small 
rectangles in the proof of Lemma 14. Now, choose Q3g C 9^^ to be a rectangle whose distance 
to the u-sides of d\q is sufficiently large; say each x G divides the curve Wg ,^ fl £Hg in a ratio 

between 1/5 and 4/5. As Tg is one-to-one, the components of TJ^dK^ H^Hg are disjoint. Assuming 
s' is large, these components are very thin strips, almost aligned with the unstable foliation. Pick 
such a component and assume that it intersects QSg. It is a safe distance away from the u-sides of 
the magnet. Inside this component lies a piece V of the curve T^W . The piece V has to extend 
to a super-proper crossing of 9^g, because W crosses D^'^ properly and because rii is large. Thus, 
V is actually a subcurve of one of the W^^. □ 

Lemma 17. There exist numbers d" > and s" > 1 such that if [W, u) is a standard pair and 
n > s", then iy{f-"iU,^,Wl^)) > d" . 

In other words, the fraction of W that will cross the magnet super-properly after n steps 
is at least d". 
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Proof. Fix a k such that W crosses 91^ properly. This is possible by Lemma 14. By Lemma 16, if 
a component of T^£H'^ n£Hg intersects the subrectangle it is crossed by the curve component 
j for precisely one value of the index i G J. In this case let denote the former component of 
f^m'^n^q. Thus is only defined for a subset a«8, C J of indices. We have z/(f -"(UigaVT^^J) > 
z/(f-«(Uga,,W^„',)) = ^(T-"(U,ga,,W^„^ n<'j)) > c/i,(f -"(U,ea,,<:D) = > 
c/ig(Uig3,3^^H^'j n ?Bg) = c/ig(Tg"^H'= H > f/i,(9^'=)/i,(!Bg) if n > s" and s" is large. The last 
step in the estimate follows from mixing of the invariant measure fig. The third step relies on 
the absolute continuity with bounded Jacobians of the holonomy maps of Tg as well as on the 
regularity of u and of the conditional measures of fig on the unstable leaves of Tg. To finish, 
notice that d" = fig(y{'')fig{^q) > 0, since the interiors of 9^'^ and ?Bg are nonempty. □ 

For an unstable curve W, let Wn^i now be the connected components of TnWn'Dlg, labeled by i, 
that correspond to proper crossings. That is, each W„ j is a subset of a longer curve Wn^i C TnW 
which crosses 9^^ properly and Wn^i HDlg = Wn,i. 

Corollary 18. There exist numbers > and s'q > 1 such that the following holds. Let Ti G Ug 

for each i and {W, v) he a standard pair. If n > s'q, then u{T^^ (UiWn^i)) > rfg. 

Proof. Fix m > In^/lnAg. By Lemma 9, {TmW^Tm^) can be broken into a finite collection 
of standard pairs {Wj.Vj) such that TmW = UjWj and 7^i^ = '^jCjUj, where < Uj < 1 and 
Cj = 1. For each j, by Lemma 17, there is a finite collection of disjoint (minimal) subcurves 

Vj^k C Wj such that T^ Vj,k crosses the magnet ^Hg super-properly. Moreover, Xlfc — 
and, by Lemma 15, the images Tm+s",m+iVj,k cross the magnet dig properly. We also have 
Tm,T^{LljUkVj^k) = Cj PjiVj^k) > d" . Let us relabel the collection of the subcurves T^^Vj^k C 
W by Ui. We have so far shown that v{VJiUi) > d" and that Tm+s"Ui crosses the magnet ^Rg 
properly. 

We are almost done, but we still need to truncate each Ui to a subcurve Ui so that Tm+s"Ui = 
Tm+s"Ui n yig and argue that i^(U;f/;) > aiy{UiUi) for some constant a > 0. Such a trun- 
cation amounts to choosing the subcurve V^jt = TmUi of Vj^k C. Wj so that Tm+s",m+iVj,k = 
Tm+s",m+iVj^k n dig. Now i^jiVj^k) > ctUjiVj^) follows from two observation: 

• |Vj-,A;| is bounded uniformly away from zero, because Tm+s",m+iVj^k crosses Dig completely. 

• \Tm+s" ,m+iVj^k\y^q\ IS bouudcd Uniformly from above, because \Tg"Vj^k\^q\ was assumed 
to be as small as possible (for a super-properly crossing curve). Therefore \Vj^k \ Vj^k\ = 
l^'.fc \ '^m+s" m+i^il bounded uniformly from above. 

Indeed, it is implied that | Vj-,fc|/|V^-,fc| > for some a' G (0,1] so that, by estimate (5), 
^j{Vj,k)/iyj{Vj,k) > D-W for all j, k. ' 

Notice that only s" affected the size of Eg. This happened when Lemma 15 was used. □ 

4.3. Time-dependent magnets. For the rest of the section, let (Tj) be a sequence of the form 
described in the Introduction, which is not confined to a neighborhood Ug of any one map Tg. 

The Coupling Lemma needs to hold for the compositions in (3). For this reason we cannot use 
the same magnet for all times. Moreover, the stable foliation that we will use to couple points 
(more correctly some of the probability masses carried by these points) on the magnets changes 
with time. This will guarantee that what has aleady been coupled will always remain coupled. 
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Therefore, we need to introduce the following time- dependent magnets: For every g G {1, . . . , Q} 
and every n G /g, define 

DJln = {x E^iq : W" n yig connects the u-sides of and has only one component}. 

In other words, consists of those leaves of W" that connect the u-sides of Dig and are entirely 
inside fRq. 

We say that an unstable curve W crosses OJt.„ properly if n G /q and W crosses 9^^ properly. 
Assuming that the cones C^,^ are narrow enough, Tin is close to Dig, and we have \W fl Tln\ > 

i\wn^g\. 

Lemma 19. Assume that W crosses the magnet 9^^ properly and that W carries the measure 
du = pdmw, where p satisfies (4). Then, for all n E Ig, 

z/(iyn9Ji„) > ]^u{wn^g). 

Proof. By (4), p(y) > p(x)e~'^''l^^^''l'" for any x,y G W \^ SHg. In particular, we can av- 
erage the left side over W fl 9Jt„, and the right side over W fl obtaining p{W fl > 
^-^f^^e~'~^^^'^^'^''^'^' viyV P\'^g). Since SHg has small diameter, the exponential factor is close to 1. 
For n G Jg, we also have \W fl ajt„| >\\Wn^g\. □ 

For a standard family with respect to {C^j.}, Q = {{Wa,i^a)}a&, let Wa,n,i be the connected 
components of TnWa H Tin that correspond to proper crossings, and introduce the notation 

Wa,n,. = rn\l^^Wa,n,i). 

Next, we generalize Corollary 18 of Lemma 17. 

Lemma 20. There exist numbers Sq > 1 and do > 0, such that the following holds. If (Tj) is 
a sequence of the general form described in the Introduction and Q = {{Wa, i^a)}a& o standard 
family with respect to {Ci^} then, for I < q < Q and Ug^i + sq < n < rig, 



>/2l 



In other words, there are time windows for n, such that a significant fraction of the image 
under Tn of the standard family Q lies on the magnet 9Jt„, ready to be coupled. 

Remark 21. Fixing some 1 < q < Q and k E Ig, Lemma 20 can be applied to the shifted 
sequence {Ti)i>k CL^d Q = {(Wa,i^a)}a& (i standard family with respect to {Cq^}- Then, if 
k — l + SQ<n< Ug, as well as if Ugi^i + Sq < n < Ugi for some q' G {g + 1, . . . , Q}, at least a 
do-fraction of its image under Tn,k ^^es on the magnet 9Jl„ as a result of proper crossings. 

Proof of Lemma 20. The s'q in Corollary 18 depends on q. We take So larger than the maximum 
of these numbers over 1 < q < Q. If Ug^i + sq ^ ^ < '^g? and if so is taken sufficiently 
larger than Sq, then the image of Q under Tnq_i+so-s'Q after standardizing the curves becomes a 
standard family with respect to {C^^.}. Applying Corollary 18 to this standard family and the 
map Tn,nq.i+so-s[^+i yields a lower bound on the 7^/ig-measure of proper crossings of Tiq. From 
this we infer a lower bound on the proper crossings of 9Jl„ by the regularity of densities with the 
aid of Lemma 19. □ 



20 



MIKKO STENLUND 



4.4. Coupling step. Consider first two standard families, Q = {{Wa, i^a)} and S = {{W13, 1^13)}, 
consisting of one standard pair each. 

A good fraction of the images TsoWa and TsoWj^ cross the magnet 971^0 properly, so that 
Tsoi^a{^iWa,so,i) = i^a{Wa,so,*) > do and 7^o (Uj W^/3,^*o J ) = ^fiiWfj^so,*) > do- Here i and j run 
through some finite index sets and T^gZ/ is the pushforward of u. 

Recall that, for a curve W, W denotes the rectangle W x [0, 1] with base W, and if W carries 
a measure di^ then W carries the measure dO = dv ® dt. We will construct a coupling from a 
subset of UiWa,so,i to a subset of ^jWj3^so,j and then show that the complements of these subsets 
can be coupled recursively. 

With small preliminary preparations, we can assume that the cardinalities of the index sets 
for i and j are the same, so that we can pair each Wci^so,i with precisely one H^/3,<io,j. Furthermore, 
we can assume that their relative masses agree: 



v, 



,so,i) _ '^l3,so, ,s 



a,so,i\' ' a,so,iJ _ '^p,so,tV ' ' P,so,tJ (15) 

Here v.^s^^i is the measure on W.^so,i and Z,^so = '}2i'^.,soA^-,so,i) = ^.{^.,so,*)- To see that 
no generality is lost making such assumptions, consider a rectangle W with a measure z) on it. 
We can subdivide it into lower rectangles W x J^, where C [0, 1] is an interval. Then, each 
W X Ik is stretched affinely onto W x [0, 1] and equipped with the measure duk = \Ik \ dv. In 
other words, we end up with replicas of W equipped with lowered measures. Such an operation 
on W is measure preserving, because the pushforward of the measure v\wxik under the affine 
map A : ly X Jfc — )■ ly X [0, 1] is precisely Uk- Subdividing the rectangles in the families {Wo,^so,i} 
and {W^/3,so,i} necessary, and relabeling the resulting rectangles, we can tune the number of 
rectangles as well as their relative weights so as to arrive at the convenient situation described 
above. Each rectangle W,^s^^i now comes with an associated affine map A. ^o,* (which is the 
identity if no subdivision of the particular rectangle was necessary). Some of the rectangles will 
have a common curve as their base on the manifold Ai, but this is not a matter of concern. 

For each fixed z, we can couple a subset of Wa,so,i to a subset of Wp^sQ^i as follows. Choose a 
number Ta G (0, 1/2] such that 

■ Zct^so = (16) 

Now, fix i. Omitting some ornaments for the sake of readability, let h stand for the holonomy 
map from Wa^so,i to Wi3^so,i associated with the stable foliation and denote by p. the density 
of z^.,so,i with respect to dmw, ,,^ i ® dt. 

The subset W'^^^^^^ = {{x,t) G Wa,so,i ■ < t < Ta} is coupled to a corresponding subset 

W'fi,so,^ = {(y^s) e Wp,so,i ■■ < s < Tp,i{y)} via the coupling map 6',^^^ : W^,^,^^, ^ W'^^^^^, : 
(x, t) \-> {y, s) with 

y = hx and s = 
Notice, however, that Tp^i is not constant but a function. It is given by the consistency rule 

nAy),,iy) - (IT) 

The expression on the right-hand side of (17) equals the pushforward of the density Tapa under 
the holonomy map, evaluated aXy = hx. This guarantees that the coupling is measure preserving: 
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if / is a measurable function W^^^^^^ 

f{Q',Jx,t))dt 



then 



w' 



/ [/ 



^, , , dmwa 



f{y,s) ds 



Ta Pa{Xj , 



(y) 
Ay) 



f{y,s) di'f}^so,i{y,s). 



w 



We thus have a couphng for each value of the index i, and have therefore managed to couple 
exactly do/ 2 units of mass between the families {1^0,50,4} and {VF^,so,i} via a measure preserving 
map. 

We are now in position to describe the desired coupling map from a subset Wa C Wa to a 
subset W/s cW/s. Define 

Wa = {ix,t) E Wa ■ iJ'soX,K,so,it) ^ ^L,so,i ^ome i}, 
Wp = {{y, s)eWp : {Tsoy,^p,so,is) e W'p,so,i fo^^ some i}. 

The bijective map : Wa — )■ Wp is defined for a point {x,t) G Wa such that {J-'s^x, Aa,so,i^) ^ 
Wa,so,i by the rule 



{y,s) = Q{x,t) 



{.•^ soy 1 ■^fS,so,i'^) ®soi*^*^ soX, AYa,so,i^) • 



Because the affine maps and the couplings 0'^^^ j are measure preserving, also is measure 
preserving; the pushforward of the measure under Q is fisly^r^. In particular, the amount of 

coupled mass equals figiWa) = = '^o/2. Finally, the coupling time function T : Wa — ?■ N 

is defined by 

T{x,t) = So- 

We now have a complete description of how to couple do/ 2 units of mass of any two standard 
pairs. Thus, given two standard families Q = {(Wa, 'ya)}ae% and £ = {(VT/?, i^/3)}^e!B) we can 
couple a subset Wa C Wa with a subset Wjs C Wp for any pair (a, /3) G 21 x ?B, in which case 
we have I'aiWa) = J-'iiiWji) = do/2. Recall that the index sets 21 and 25 carry probability factor 
measures Xg and A^:, respectively. They detail how much weight is assigned to standard pairs. 
Suppose the sets 21 and are finite or countable. Splitting off subrectangles if necessary and 
stretching them affinely onto complete rectangles as described earlier, we can assume that there 
exists a bijection A : 21 — 03 that preserves measure, i.e., AXg = X^. Hence, the coupling map 
can be constructed from a subset Uae^Wa C Ucga^a to a subset U/3gsBW^/3 C. Ui3^<bWi3 so that 
measure is preserved and in particular so that flg{l-)ae<2i^a) = Psi^pe'sWij) = do/2. On Uae<2i^a 
we set T = sq. The map can be constructed also for uncountable families, but we omit the 
details [7]. 
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4.5. Recovery step. Our task is to couple the remaining points of Wa \ Wa to those of Wjs \ 
and to extend 6 and T to all of Wa- We do this recursively. But first we need to prepare the 
uncoupled parts of the images FsoWa and J^so^p so that they also can undergo the coupling 
procedure described above. 

On the one hand, we have the sets FsoWa \ UjW^o.so.i and J^so^p \ ^iWp^so,! which consist of 
several rectangles whose base curves do not represent proper crossings of the magnet OJIsq. In the 
worst case, such a base curve is the excess piece of a longer curve that has crossed the magnet 
QJtsQ properly, but by definition such excess pieces have a uniform lower bound on their length. 

On the other hand, we also have the sets Wa,so,i \ W'^ ,.^ ^ = {{x,t) G Wa,so,i • 'Ta < t < 1} and 
Wp^so,i \ ^/3,so,« ~ {(y^^) ^ ^/3,so,i ■ 'TpAy) < -5 < 1} whose bottom complements were coupled 
already. We stretch each Wa,so,i \ W^Q,so,i affinely onto the complete rectangle Wa,so,i, replacing 
the density pa on it by (1 — Ta)pa- Similarly, we stretch each W^/3,so,j \ W^/3,so,i '^^^'^ ^l3,so,i so that 
each vertical fiber {s : (y, s) G VK/3,so,j5 '^i^Ay) < < 1} is mapped affinely onto [0, 1], and replace 
the density by the density (1 — r/3^j)p^. These transformations are measure preserving; the 
pushforward of the original measure on the incomplete rectangle is precisely the new measure on 
the complete rectangle. Notice that the base curves W,^so,i are quite short — of the size of the 
magnet — but nevertheless have a uniform lower bound on their length. 

In conclusion, a finite recovery time tq will be sufficient for the map TsQ+rQ,so+i to stretch the 
base curves of all the remaining rectangles above to standard length. In fact, some may grow too 
long but can then be standardized by cutting into shorter pieces, as has been discussed earlier. 

We still need to address the issue of regularity of (1 — ra)pa and (1 — Tp^j^pp as well as show 
that r/3^j is actually well defined, i.e., that its values do not exceed 1. 

Lemma 22. Let us take rj^ = rj-^ ^. Then sup rg.j < 1. Taking the recovery time tq sufficiently 
long, the densities (1 — Ta)pa and (1 — Tp^i)pi3 become regular under Tso+ro,so+i- 

Proof. Throughout the proof we assume that Sq is sufficiently large to begin with. 

By (the proof of) Lemma 9, pa and pp are regular densities on the curves Wa,so,i and 
respectively. Since multiplication by a constant preserves the regularity of a density, (1 — Ta)Pa 
is regular. Showing that (1 — Ti3^i{y))pi3{y) is regular requires some analysis. 

Recall that the holonomy map h maps Wa,so,i onto H^/3,so,i by sliding along the connecting 
leaves of the stable foliation W**". Both curves as well as the leaves are inside the magnet. 
Assuming that the magnet is sufficiently small, h is as close to the identity as we wish: given any 
5 > 0, we may assume that \J'h — 1| < 6. This follows immediately from Lemma 11, because it 
can be applied to the /c-step pullback of h that maps T~].^_,^_^_iWa,so,i onto T~].^_i^^-^Wj3^so,i with 
k large and the connecting leaves of yV''^"~^ sufficiently short (shorter than £o) for Lemma 11 to 
apply. 

For each x G Wa,so,i denote y = hx E Wf}^sQ,i- As |W/3,so,i(2/i, 2/2)! = Iwc.{xux2) ^hrfm,y^ ^^ ,, 

(1 -5)|i^„,,„,,(xi,x2)| < \Wp,so,iiyi,y2)\ < i'^ + s)\Wa,soA^u^2)\- (is) 

Observe that from (4) follows easily 



'This fixes the value of Tyr. 
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or 

I .,So,t\ 

By making the magnet small, the values of p. on W,^so,i are thus as close to its average as we 
wish. By (17), 

1 - pp[y) 

where we have recalled (15), (16), ^/3,so = i^p{Wj3^so,*) > '^o, and (18). The right-hand side can 
be made arbitrarily close to |, so that we can take, say, supr/j^j < |. 
From (17) and Lemma 12 it then follows that 

I \n{TpAyi)pp{yi)) - ln(r/3,i(?/2)p/3(z/2))| < | Inpa(xi) - lnpa(x2)| + | \n.J\i{x2) - In J^h(xi)| 

< (Cr + Ch) (1 - 5)'"^'"(''-'"^W/3,so,(yi, 2/2)r'"(''''''^). 

Hence, ln(rg_jp/3) is Holder and then so is In rg_j = ln(rg_jp/3) — Inp^. Using the estimates 
min(a, 6) | In a — In 6| < |a — 6| < max(a, 6) | In a — In 6| a, 6 > 

obtained from the mean-value theorem, 

I ln(l - r/3,i(yi)) - ln(l - TpAy2))\ 

1 - sup Tp^i 1 - sup r/3,i 

< 3(2^ + Ch)(l - 5)"--(''-''*^)|iy;3,,„,(yi,i/2) 



|mm(r;r,»7h) 



A similar estimate is obtained for (1 — T^^ijpp. The Holder constant is too large for the density 
to be regular, but (the proof of) Lemma 9 guarantees that it will become regular after a finite 
number, ro, of time steps. □ 

Finally, at time So + tq, we normalize the measures on all the rectangles to probability measures 
thereby modifying the factor measures (see Introduction) associated with the rectangle families. 
As a result, we have two new standard families that can be coupled just as the original ones. 

4.6. Exponential tail bound. For the standard families Q and the first coupling is con- 
structed at time sq, when enough mass of each family is on the magnet QJIsq: 

pg(T = So) = —. 

After every coupling, there is a recovery period of ro steps, during which curves too short can 
grow to acceptable {i.e., standard) length and densities get regularized sufficiently. After recovery, 
another sq iterations are required to bring enough mass from each standard family on a magnet 
for the next coupling to be constructed. At the moment of the [k + l)st coupling, 

/ig(T = k{sQ + ro) + So I T > {k - l)(so + ro) + So) = y 
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is the fraction of the previously uncoupled mass of each standard family Q and £ which lies on 
the magnet 07tfc(so+ro)+so ^^"^ becomes coupled. Hence, 

flg{T = k{so + ro) + So) = Y ( 1 - y 
This finishes the proof of the Coupling Lemma. □ 

Appendix A. Subspace distance 

A natural notion of distance between subspaces A,B C M^^ is obtained by comparing orthog- 
onal projections to the subspaces in the operator norm: 

distiA, B) = \\Pa-PbI (19) 

where P. are the corresponding orthogonal projections. Notice that 

Pa^b = Pa + Pb, if a ±5. (20) 

We will also measure the distance between 1-dimensional subspaces using the metric 



\l/2 

U.10U ^ ) — liilli II U. — (-/ II — V ^ ^^-1- — ^ I 

u(iA,v(iB:\\u\\ = \\v\\=l 

where 



dist'(/l,5)= min \\u - v\\ = V2{1 - {A, B)f''^ , (21) 

u(iA,v(iB: ||ji|| = ||-u||=l 



{A, B) = max {u, v). 

u(iA,v(^B: ||u|| = ||i)||=l 

Let u & A and v E B he unit vectors such that (m, v) > 0. Then 

\\Pav - Pbv\\ = 1 - {u,v)^ > 1 - {u,v) = ^\\u-vf 

Hence, 

dist'(yl,5) < V2dist{A,By/^. 

Appendix B. Uniform Holder continuity of the (un)stable distribution 

Lemma 23. For all n, the distributions and (see Section 2) are Holder continuous with 
the same parameters, and the latter do not depend on the choice of the sequence (Ti). 

Before giving the proof, we need two auxiliary lemmas. 

Lemma 24. For 1 < q < Q, there exist constants kg E N and < C'^ < 1 such that the 
following holds when Sg is small enough. If each Ti G Ug for a fixed q and if v E T^M. \ j., 
then \\DxTnv\\ > C'^K^\\v\\ for all n > 1 and D^TnV G C^t^^ for all n > kg. These statements are 
uniform in x. 

Proof. \{v e T^M \C^%, we have > ag||i;"||. But = D^f^v'''' e and |K|| > 

C^A^||t;"|| and \\v'\\ > CgA^|K||. We have > C^a^A^^llt;^!! > 2a-'^\\v'J ior n > kg, if kg is 
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sufficiently large. Because WD^Tk^- D^fg'W < CSq, \\D.^rk^v'\\ > c\\v'\\, and \\D^fg''v''\\ > c||t;"|| 
hold with some C = C{kg) and c = c{kq), we have 

> 2a-\\\D,%,v^\\ - CsgWv^m - Cc-'sg) 

> 2a-'\\D,rky\\{l-Cc-'egf > a;'\\D,Tky\\, 

provided Eg is small enough. This estimate shows that DxTk^v G Q'j-^ ^. 

The uniform estimate ||i5^7^f || > Cg\\v\\ holds with some Cg = Cg{kg) < 1 ioT 1 < n < kg. If 
n = kg + m, \\DxTnv\\ > CgAg''\\DxTkgV\\ > CgCgA™||t;||. Hence, we can set C'^ = CqCg/Ag', so 
that WD^TnvW > C^A^II^^II for all > 1. □ 

The next result on linear maps is cited from [6, Lemma 6.1.1] up to notational changes. 

Lemma 25. Let C\ and C\, n & N, be two sequences of linear maps — t- M.'^^ . Assume that 
for some b > and 6 G (0, 1), 

\\Cl-Cl\\<5b-, n>0. 

Suppose there are two subspaces and o/M*'^ and constants C^, > 1, < X^, < fi^ with < b 
such that 

' \£iv\\ <CA: ifv&8\ 
J/:>||>a"V" tfw^£\ 
THgti 

A* 

Proof of Lemma 23. We generalize the case of a single map found in [6] . As the orthogonal 
complements (T^A^)"*" of T^M. in M*^^ form a smooth distribution (TM.)-^ on A^, we first prove 
that the distribution © {TM)-^ is Holder continuous on M. and then deduce the Holder 
continuity of E^. 

Let Px be the orthogonal projection M*'^ — >■ T^M.. It depends smoothly on x. We define 
This map extends D^Ti to a linear map M^^ M^^. Let us also set 



/:„(x) = L(")(r„_ix)---L(i 



X) 



Recall from above that E^ C Cl ^. Setting C = ni<g<Q ^^'^ ^ = mini<g<Q Ag, we have 

\\DxTnv\\ <C-^A-''\\vl veEl, 

by (A2). This translates to 

\\Cn{x)v\\ < C-^A-'Wvl veEl® (TxM)^. (22) 

By Assumption (A4), we can make the cones so narrow that a vector in TxAi perpendicular to 
E'^ lies in the complement of C^ .^.. Setting C = ni<g<Q Lemma 24 thus yields 

ll^x&ll > C'A^llwll, weTxM:w±El 

In other words, 

\\Cn{x)w\\ > C'A^wl wLEl® {TxM)^. (23) 
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Clearly || = ||Z)a,Tj||. For brevity, let us write bi = suprpSup^^j^\\DxT\\ and 62 = 

snpTsnp^^j^\\Dx{D^ToP^)\\,wheTeT runs overUiU- ■ -UKq. Since = L("+^)(7^x)£.„(x), 

\\Cn+i{x) - £„+i(y)|| 

< ||L("+i)(r„a:)|| \\C^{x)-CM\\ + \\L^''-''\rnx) - L^''-''\Xy)\\ 

< h\\Cn{x) - Cniy)\\ + b2\\rnx - XyW hi < hi\\Cn{x) - Cn{y)\\ + - y\\. 
As — C2{y)\\ < h2\\x — y\\, we obtain the bound 

<c||x-2/||6f, (24) 

with the constant c = , .fr 

fei(bi-l) 

The bounds (22), (23), and (24) show that all conditions of Lemma 25 are satisfied, if we 
take C\ = Cn{x), Cl = Cn{y), £^ = ® {T,M)^, and £^ = ® {TyM)^. Writing a = 
21nA/(21n6i + lnA) and K = 3max{C~\C'~^yA^ c"", 

dist(E° © (T^M)^, El © {TyM)^) < K\\x - y^ 

provided ||x — y\\ < 1/c. 

By compactness of the manifold and smoothness of the distribution (TA^)^, we have 
dist((T^A^)^, {TyM)^) < L\\x - y\\ for some L. Hence, by (19) and (20), 

dist(E°,Ej) < {K + L)\\x - yW'' if \\x-y\\ < 1/c. 

Notice that this bound does not depend on the sequence (Tj)j>i. Moreover, the same upper 
bound is obtained for each distribution E^ by disregarding the first n maps and considering the 
sequence (Tj)j>„ instead. The result for F" is obtained by reversing time. □ 

Appendix C. Inclination Lemma type results 

Lemma 26. Fix I < q < Q and let Ti G Uq for each i. There exists a constant > such 
that, for all w G Q.^. and all w G T^J^ with \\w\\ = \\w\\ = 1, the vectors Wn = DxTnW and 
Wn = DaJnW Satisfy 

^<C# Vn>0. 
IknII - * 

// also w G C" , the angle between Wn and Wn tends to zero at a uniform exponential rate: 



1 - 



for some < c < 1 and C > 0. 



Wr,. Wr 



< min(c, CA-^") (25) 



Proof. We first construct "fake" stable and unstable distributions for finite sequences Ti, . . . , Tj^. 
Fix > 1. First, choose a distribution E'^ such that E!^ <Z C^^ for all x. Then define the 
distributions < n < A^, by pulling back: = Dt^^^^T^I^E'^'^^I^^ C C^^. Next, choose 
a distribution F° such that F° C Q .j. for all x. Then set recursively F" = Drp-i^T^ F^-i^ for 

n > 1. We will use F" and F^ as coordinate axes. Let e" G -E^ and G be unit vectors 
oriented so that D^T^z^ and point in the same direction and D^Tnf^ and f^^^! point in the 
same direction. 

Recall from Section 1.3 that the angle between E^^ and E'^^ is uniformly bounded away from 
zero. In other words, there exists a. ipg > such that (F^^,F^^) < 1 — 2?/;^ for all x E Ai. 
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By Assumption (A4), the cones can be assumed narrow enough, so that (f/, V) < 1 — tpg for all 
subspaces U C C^^^ and V C C^^^, and for all x E Ai. Because of this, |(e", f^)| < 1 — ipg for all x 
and all < n < iV. Thus, we have the uniform bounds 

+ /?')< IK + /3e^f < 2(a2 + /32), Va,/3, 0<n<N. (26) 
Now write Wn = Cinfr„x + Pri^Tnx ^^^^ = C(nfr„x + Pri^Tnx ^^^^ estimate, for < n < A^, 
ll^nlP . 2(a^ + ^^) 2~Pl\\a^ry£ ^ '2al , 2^o^(C,A^)-^ 



We have used Assumption (A2) to bound the norms involving D^Tn- From (26), a^-, < 
As w is an unstable unit vector, |ao| is bounded from below by some Ag > 0. Thus 



< -^(l + O = Ci 0<n<N. 



- 1 



But A^ was arbitrary, so the bound holds for all n > 0. In particular, C# does not depend 
on the constructed distributions. A computation also shows that, if both w,w E Q^, then 
1 - |(w„,w„)|/||w„||||w„|| is of order /3„/3„,/a„a„ < C-%-^"73o/3o/aoao < CA"^''. □ 

Lemma 27. Fix 1 < q < Q, < X < 1, and 6 > 0. Fix N > 1 and Ti e Ug for 1 < i < N. 
Take two points Xi,X2 and assume that d(TnXi,TnX2) < 5A" ifO<n<N. Suppose Wi,W2 are 
unstable curves with respect to {Cg^} and that Xi &Wi. Then 

JTnWiTn+ljTnXi) 
>^TnW2Tn+l{TnX2) 

The constants C > 1 and < fi < 1 are independent of N, of the curves Wi, and of the choice 
o/Ti, . . . , T/v, as long as the bound on d(TnXi, 7^X2) continues to hold. 

Proof Choose F° = E^^ and define F^ = D^-iJ'n F^fi^ ioi 1 <n < N. By Lemma 23 \ the 

distributions belong to a fixed Holder class, no matter which Ti, . . . ,T/v and A^ are chosen. 
Notice that F^ C for < n < A^. 

Let u"^ {i = 1,2) stand for a unit tangent vector of TnWi at = T^Xj. We also write 
L(x) = DxTn+i o Fx, where Fx is the orthogonal projection M*'^ T^Ai, which is smooth. 

JTnWiTn+l{x1] 



< C"/i" < n < A^ - 1. (27) 



J%.W2Tn+l{x2, 



1 



< — |p,„r„+i<|| - p,.5r„+i<||| = |||L(x^)<|| - ||l(x^)m^| 

< ^ i\\L{xl) - L(x^)|| + ||L(x^)|| minIK - a<| 



< C(A" + dist'(f/f,?72")). 

We have denoted by Uf" the linear subspaces of MJ^ spanned by m" and recalled the definition in 
(21). The angle between [/f and F^r. decays exponentially: dist'(?7f , /;"„) < CA^^n, (^21) 

and (25). Hence, it suffices to prove exponential decay of dist'(F"n, F"n). But this follows from 
Holder continuity and the assumption (i(x", Xg ) < ^ A". □ 



'^Lemma 23 has been formulated for as defined in (6). Considering the special case Q = 1, we can clearly 
recover the claimed result for F" as defined here. 
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